Host stop-transmission handling

ABSTRACT

A memory system includes a controller and a memory array that stores partial-page data and complete-page data in separate areas. Data received from a host is sent from a memory controller to an on-chip cache prior to determining whether the data is partial-page data or complete-page data. After a determination is made, the data is stored at an address in the corresponding area.

BACKGROUND OF THE INVENTION

This invention relates to nonvolatile memory systems and methods of operating nonvolatile memory systems.

Nonvolatile memory systems are used in various applications. Some nonvolatile memory systems are embedded in a larger system such as a personal computer. Other nonvolatile memory systems are removably connected to a host system and may be interchanged between different host systems. Examples of such removable memory systems include memory cards and USB flash drives. Electronic circuit cards, including non-volatile memory cards, have been commercially implemented according to a number of well-known standards. Memory cards are used with personal computers, cellular telephones, personal digital assistants (PDAs), digital still cameras, digital movie cameras, portable audio players and other host electronic devices for the storage of large amounts of data. Such cards usually contain a re-programmable non-volatile semiconductor memory cell array along with a controller that controls and supports operation of the memory cell array and interfaces with a host to which the card is connected. Several of the same type of card may be interchanged in a host card slot designed to accept that type of card. However, the development of the many electronic card standards has created different types of cards that are incompatible with each other in various degrees. A card made according to one standard is usually not useable with a host designed to operate with a card of another standard. Memory card standards include PC Card, CompactFlash™ card (CF™ card), SmartMedia™ card, MultiMediaCard (MMC™), Secure Digital (SD™) card, a miniSD™ card, Subscriber Identity Module (SIM), Memory Stick™, Memory Stick Duo card and microSD/TransFlash™ memory module standards. There are several USB flash drive products commercially available from SanDisk Corporation under its trademark “Cruzer®” USB flash drives are typically larger and shaped differently than the memory cards described above.

Different types of flash memory array architecture are used in nonvolatile memory systems. In one type of architecture, a NAND array, strings of more than two memory cells, such as 16 or 32, are connected along with one or more select transistors between individual bit lines and a reference potential to form columns of cells. Word lines extend across cells within a large number of these columns.

An individual flash memory cell may hold one bit of data in what is known as a Single Level Cell (SLC) memory. In some examples, a memory cell may hold two or more bits of data in what is known as a Multi Level Cell (MLC) memory.

SUMMARY OF THE INVENTION

A nonvolatile memory system according to an embodiment of the present invention stores complete-page data in one portion of a memory array and stores partial-page data in a different portion of the memory array. Data is sent from a controller to an on-chip cache prior to determining which of these areas the data is to be stored in. Then, after the data is stored in the on-chip cache, a determination is made and the data is written to the appropriate location.

According to an embodiment, a method of managing storage of data in a memory array that separately stores complete-page data and partial-page data comprises: storing received data in an on-chip cache in a peripheral area of the memory array prior to writing the received data to the memory array; and when a stop-transmission command is received prior to writing the received data to the memory array, and the received data is less than the data of a complete page, writing the received data to an area of the memory array that is dedicated to storage of partial-page data.

The stop-transmission command may be a Secure Digital stop-transmission command from a host. The received data may be initially assigned to a first location in the memory array that is dedicated to storage of complete-page data, and when the stop-transmission command is received, the data is assigned to a second location in the area of the memory array that is dedicated to storage of partial-page data. The data may be assigned to the second location by a memory controller or an on-chip circuit that is peripheral to the memory array. The memory array stores data in multi-level cells that individually store more than one bit of data. The memory array may comprise two or more planes that are programmed in parallel. The area of the memory that is dedicated to storage of partial-page data may extend across the two or more planes or may be limited to one plane of the two or more planes.

According to an embodiment, a method of managing streams of data in a nonvolatile memory array that does not permit more than one programming operation per page without erase comprises: maintaining a first area of the nonvolatile memory array for storage of complete pages of data; maintaining a second area of the nonvolatile memory array for storage of partial pages of data; storing a portion of data in a page buffer in the memory periphery before a determination is made whether to store the portion of data in the first area or the second area; subsequently, determining whether to store the portion of data in the first area or the second area; and writing the portion of data from the page buffer to the first area or the second area according to the determination.

The determining may be performed in response to receiving a stop-transmission command. The stop-transmission command may be a Secure Digital stop-transmission command received from a host. The nonvolatile memory array may comprise two or more planes and the second area may extend across all of the two or more planes, or the second area may be contained within one of the two or more planes. Determining whether to store the portion of data in the first area or the second area may further comprise determining whether the data is complete-page data or partial-page data. Determining whether the data is complete-page data or partial-page data may further comprise detecting whether a stop-transmission has been received.

According to an embodiment, a method of handling streaming data in a buffered nonvolatile memory comprises: receiving a stream of data from a host without a prior indication of the amount of data; copying a portion of the stream of data from a memory controller to an on-chip cache prior to identifying the portion of the stream of data as partial page data; while the portion of the stream of data is in the on-chip cache, receiving an indication from the host that requires the portion of the stream of data to be written to the memory array; in response to receipt of the indication, determining that the portion of the stream of data is partial-page data; in response to the determination, selecting an area of the memory array that is dedicated to storage of partial-page data; and writing the portion of the stream of data to the area.

The indication may be a Secure Digital stop-transmission command. The memory array may comprise two or more planes and the area may extend across all of the two or more planes, or the area may be limited to fewer than all of the two or more planes. The area may be limited to one of the two or more planes.

According to an embodiment, a nonvolatile memory system comprises: a nonvolatile memory array that includes a first area for writing complete pages and a second area for writing partial pages; and an on-chip cache in a peripheral area of the memory array, the on chip cache storing data to be written to the nonvolatile memory array prior to determining whether to write the stored data to the first area or the second area.

The memory array may comprise two or more planes. The second area may extend across all of the two or more planes, or the second area may be contained within one of the two or more planes. The nonvolatile memory system may further comprise a memory controller that sends data to be stored in the nonvolatile memory array, the memory controller in communication with a host. The nonvolatile memory system may further comprise a memory card physical interface for communication with the host. The physical interface may be a Secure Digital interface and communication with the host may communication according to the Secure Digital standard. The on-chip cache may be configured to determine whether to write the stored data to the first area or the second area in response to receipt of a Secure Digital stop transmission command from the host.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows programming of an SLC memory.

FIG. 2A shows a first stage of programming an MLC memory.

FIG. 2B shows a subsequent second stage of programming the MLC memory of FIG. 2A.

FIG. 3 shows a memory system in communication with a host.

FIG. 4 shows a memory chip with two planes.

FIG. 5 shows a memory array with an area for storage of partial-page data and a separate area for storage of complete-page data.

FIG. 6A is a timing diagram for a data transfer scheme.

FIG. 6B is a timing diagram for an alternative data transfer scheme.

FIG. 7 shows programming of partial-page data in a two-plane memory array.

FIG. 8A shows a timing diagram for an example of partial-page programming.

FIG. 8B shows a timing diagram for another example of partial-page programming.

FIG. 9A shows an example where all partial-page data is stored in one plane of a two-plane memory array.

FIG. 9B shows an example where data is relocated through controller RAM.

FIG. 10 shows a flowchart of a data storage scheme.

DETAILED DESCRIPTION OF ILLUSTRATED EMBODIMENTS

It is common in current commercial products for each storage element of a flash EEPROM array to store a single bit of data by operating in a binary mode, where two ranges of threshold voltage of the storage element transistors are defined as two memory states. The threshold voltages of transistors correspond to ranges of charge levels stored on their storage elements. In addition to shrinking the size of the memory arrays, the trend is to further increase the density of data storage of such memory arrays by storing more than one bit of data in each storage element transistor. This is accomplished by defining more than two threshold voltage levels as memory states for each storage element transistor, four such states (2 bits of data per storage element) being used in one example. More storage states, such as 16 states (4 data bits) per storage element may also be used. Each storage element memory transistor has a certain total range (window) of threshold voltages in which it may practically be operated, and that range is divided into the number of states defined for it plus margins between the states to allow for them to be clearly differentiated from one another.

As the number of states stored in each memory cell increases, the tolerance of any shifts in the programmed charge level on the storage elements decreases. Since the ranges of charge designated for each memory state must necessarily be made narrower and placed closer together as the number of states stored on each memory cell storage element increases, the programming must be performed with an increased degree of precision and the extent of any post-programming shifts in the stored charge levels that can be tolerated, either actual or apparent shifts, is reduced. Actual disturbs to the charge stored in one cell can be created when programming and reading that cell, and when reading, programming and erasing other cells that have some degree of electrical coupling with that cell, such as those in the same column or row, and those sharing a line or node.

Apparent shifts in the stored charge levels occur because of field coupling between storage elements. The degree of this coupling is necessarily increasing as the spaces between memory cell storage elements are being decreased, which is occurring as the result of improvements of integrated circuit manufacturing techniques. The problem occurs most pronouncedly between two groups of adjacent cells that have been programmed at different times. One group of cells is programmed to add a level of charge to their storage elements that corresponds to one set of data. After the second group of cells is programmed with a second set of data, the charge levels read from the storage elements of the first group of cells often appear to be different than programmed because of the effect of the charge on the second group of storage elements being capacitively coupled with the first. This is known as the Yupin effect, and is described in U.S. Pat. No. 5,867,429. This patent describes either physically isolating the two groups of storage elements from each other, or taking into account the effect of the charge on the second group of storage elements when reading that of the first group. Various programming schemes may be used to reduce Yupin effect. In particular, programming of MLC memory may be done in stages, a first stage is performed to bring a group of memory cells close to their desired charge levels. Then, only after neighboring cells have undergone at least a first stage, a second stage is performed to bring the cells to their desired levels. Thus, the final charge levels reflect changes caused by programming of neighboring cells.

Because of the higher precision required in programming MLC memory, more time is generally needed than for programming SLC memory. Also, programming in multiple steps to reduce apparent shifts in charge levels may take more time. This means that MLC storage, though more efficient in using space in a memory array, may be slower than SLC memory, at least for programming. In order to take advantage of the storage efficiency of MLC memory and the speed of SLC memory, data may initially be written to SLC memory and later copied to MLC memory. Once all data from an SLC block is copied to an MLC block, the SLC block may be erased so that it becomes available for subsequent use.

FIG. 1 shows programming of a SLC memory. The threshold voltage (V_(T)) of a cell is mapped to two distinct ranges representing two memory states and threshold voltages of individual cells are represented by distributions 10, 12 shown for logical state 1 and logical state 0 respectively. An erased memory cell represents a logic 1 state. A programmed cell represents a logic 0 state. Generally, memory cells are erased together in a unit of a block (erase block) so that all cells in the block are brought to the logic 1 state together. Cells are generally programmed together in units of a page, where a block consists of one or more pages. In one example, a NAND flash memory consists of blocks of cells, with each block consisting of multiple pages that are formed by rows of memory cells. Examples of NAND flash memory systems and their operation are provided in U.S. Pat. No. 7,061,798. During programming, cells that are to hold a logical 0 have their threshold voltage increased as shown, while cells that are to hold a logical 1 remain at their erased threshold voltage. Programming may include a series of pulse and verify steps. Once the cells are programmed, the state of a cell is read by comparing the threshold voltage of the cell to a discrimination voltage (V_(D)). Thus, any cell having a threshold voltage less than V_(D) is read as storing a 1, and any cell having a threshold voltage greater than V_(D) is read as storing a 0.

FIG. 2A shows a first stage of programming MLC memory. In particular, FIG. 2A shows the threshold voltage of a cell having four different threshold voltage ranges assigned to four different memory states 33-36. The erased state 33 has the lowest threshold voltage range, indicated by a threshold voltage less than 0 volts in this example. The first programming stage of FIG. 2A programs some cells to the next state, state 34 and leaves others in erased state 33. This programming stage (first pass) generally programs cells of a page of memory together and the bits programmed in this way may be considered as a logical page (lower page). Thus, after the first pass programming of FIG. 2A a first logical page has been programmed and all cells of the physical page in the memory array are either in the erased state 33 or first programmed state 34. Programming may be achieved by a series of programming pulses with verification performed between pulses.

FIG. 2B shows a second programming stage (second pass) that is performed subsequent to the first stage of FIG. 2A. In this stage, cells are programmed according to bits of a second logical page (upper page). For cells storing a 1 in the upper page, the memory state remains in erased state 33 or first programmed state 34. For cells storing a 0 in the upper page, the memory state is programmed to second programmed state 35 or third programmed state 36 as shown. Cells in erased state 33 are programmed to third programmed state 36 and cells in first programmed state 34 are programmed to second programmed state 35 as shown. Thus, after the second programming stage, a cell may be in any one of four possible memory states as determined by an upper page bit and a lower page bit as indicated by bits shown in distributions of cells in states 33-36. It will be understood that FIGS. 2A and 2B show an exemplary scheme and other schemes may also be used to program MLC memory.

While storing larger amounts of data per unit area in a memory array is achievable using MLC as compared with SLC, reducing the speed of programming is generally not desirable and may not be acceptable for certain applications. In particular, for removable mass storage applications (e.g. in flash memory cards or USB flash drives), hosts may require data to be stored within a specified maximum time. In order to take advantage of the storage efficiency of MLC without suffering a time penalty, data may initially be stored in SLC and later stored in MLC at a time when resources are available, e.g. data may be moved to MLC as a background operation. When the data is stored in SLC, an indication may be sent to the host indicating that the data is stored. Thus, the host sees data storage taking place at the speed of SLC storage. Subsequent storage in MLC may be transparent to the host. As long as transfer of data from SLC memory to MLC memory takes place in a timely manner, the extra space occupied by data in SLC memory may not have a significant impact.

Many memory chips have some form of latches or registers that hold data prior to, or during, programming of data to the memory array. Such latches may be used to as an on-chip cache to provide faster transfer of data. Examples of such on-chip cache are provided in US Patent Application Publication No. 2006/0136656, which is hereby incorporated by reference for all purposes. Additional examples of how data latches may be used for caching data on a memory chip are provided in U.S. Pat. No. 7,505,320, which is hereby incorporated by reference for all purposes.

FIG. 3 shows an example of a memory system 300 that may be embodied in a memory card, USB flash drive, or other removable nonvolatile memory system. FIG. 3 shows communication to and from a host, which generally occurs through a standardized interface 302. Such an interface is generally established by a standard that dictates the physical arrangement of contacts and the details of how communication is to occur between the host and memory system. The memory system 300 includes a controller 304 that manages communication with the host and also manages how data is stored in the memory array 306. In a typical arrangement, the memory controller determines where particular data is stored in the memory array and maintains a logical-to-physical mapping that relates logical addresses provided by the host to physical addresses in the memory array. FIG. 3 shows the controller 304 on a separate chip to the memory array 306, though in some examples, the controller may be on the same chip as the controller. The memory chip 308 includes two sets of data latches 310, 312 that may be considered together as an on-chip cache 314. “On-chip” here indicates that the data latches are on the same chip as the memory array, and the data latches may be considered as peripheral circuits on the memory chip 308. The on-chip cache may also be considered to be a buffer for the memory array 306, and a memory array having such a buffer may be referred to as a buffered memory array. Data latches 310, 312 are connected to bit lines so that the data to be written to a page is stored in a set of data latches which form a page buffer. Thus, the sets of data latches shown form two page buffers 310, 312, each of which contains data to be written to one page of the memory array 306. In some examples, data for a lower page may be held in one page buffer while data for an upper page is held in another page buffer. The controller 304 also includes a volatile memory (controller RAM 316) which may be used to store data from the host prior to storing it in the memory array.

In a typical arrangement, the memory array is programmed in a unit called a page which extends along a word line. In such an arrangement, a page forms the minimum unit of programming. Because each cell holds more than one bit, a word line holds more than one page. For example, where cells of a word line each store two bits of data, the word line stores two pages of data, commonly referred to as lower-page and upper-page data. A page may be programmed once with data. However, if the page is subsequently programmed with additional data (without first erasing the original data) the original data may be corrupted by the subsequent programming. Because of the risk of such corruption, a memory array may be operated so that once a page is written, subsequent writing to that page is prohibited even if the page is not full. Dummy data may be used to fill up the unused portion of such a page. However, storing less than a full page of data to a page is inefficient, and it is generally desirable to combine such partial-page data with other partial-page data so that the memory array is more efficiently used. In some memory arrays, a portion of the memory array is dedicated to storage of partial-page data. Such partial-page data may later be relocated if additional data is received to form a complete-page, or if partial-pages can be combined in a manner that forms complete-pages. Such a dedicated area may include one or more blocks. The physical area that is dedicated for partial-page storage may be changed for wear-leveling purposes. Another portion of the memory array (other than the portion that is dedicated to storage of partial-page data) may be dedicated to storage of complete-page data. Complete-page storage commonly occupies the majority of the memory array. Additional portions of a memory array may be dedicated to storage of management data (such as FAT and directory information) or other purposes.

Memory cells of a typical flash EEPROM array are divided into discrete blocks of cells that are erased together. That is, the block is the erase unit, a minimum number of cells that are simultaneously erasable. Each block typically stores one or more pages of data, the page being the minimum unit of programming and reading, although more than one page may be programmed or read in parallel in different sub-arrays or planes. Each page typically stores one or more sectors of data, the size of the sector being defined by the host system. An example sector includes 512 bytes of user data, following a standard established with magnetic disk drives, plus some number of bytes of overhead information about the user data and/or the block in which they are stored. Such memories are typically configured with 16, 32 or more pages within each block, and each page stores one or just a few host sectors of data.

In order to increase the degree of parallelism, and thus improve performance, during programming user data into the memory array and reading user data from it, the array is typically divided into sub-arrays, commonly referred to as planes, which contain their own data registers and other circuits to allow parallel operation such that sectors of data may be programmed to or read from each of several or all the planes simultaneously. An array on a single integrated circuit may be physically divided into planes, or each plane may be formed from a separate one or more integrated circuit chips. Examples of such a memory implementation are described in U.S. Pat. Nos. 5,798,968 and 5,890,192.

To further efficiently manage the memory, blocks may be linked together to form virtual blocks or metablocks. That is, each metablock is defined to include one block from each plane. Use of the metablock is described in U.S. Pat. No. 6,763,424, which patent, along with all other patents and patent applications referred to in this application, is hereby incorporated by reference. The metablock is identified by a host logical block address as a destination for programming and reading data. Similarly, all blocks of a metablock are erased together. The controller in a memory system operated with such large blocks and/or metablocks performs a number of functions including the translation between logical block addresses (LBAs) received from a host, and physical block numbers (PBNs) within the memory cell array. Individual pages within the blocks are typically identified by offsets within the block address. Address translation often involves use of intermediate terms of a logical block number (LBN) and logical page.

FIG. 4 shows a memory array 420 that is formed as two planes 422 a, 422 b, each having an array portion and an on-chip cache portion. The present application uses the term “page” to refer to the unit of programming of the memory array, which may be a page of a single plane (sometimes referred to as a “die-page”) where one page is programmed at a time, or a meta-page that extends across multiple planes where planes are programmed in parallel. Thus, the term page buffer may refer to the data latches that hold data to be programmed to a meta-page rather than just the data latches that hold data for the portion of the meta-page in a single plane. For example, data latches 424 a, 424 b form a page buffer in FIG. 4 because data latches 424 a, 424 b together hold all the data for a page that extends across both planes. In general, techniques described here with respect to a page will be understood to apply equally to a single die-page within a plane, or to a page (metapage) that extends across two or more planes of a memory array. The memory array of FIG. 4 may include a portion that stores partial-page data and another portion that stores complete-page data. In some arrangements, the portion that is dedicated for storage of partial-page data extends across both planes, while in other arrangements, the portion that is dedicated for storage of partial-page data is limited to one plane.

One feature of removable memory systems such as USB flash drives and flash memory cards is that they may be removed from a host and may interface with different hosts at different times. Generally, a removable memory system should be placed in a safe condition prior to being removed from a host or losing power from the host. Some host interface standards require that all data be saved to nonvolatile memory (not just cached in a volatile memory) in response to particular conditions so that such a removable memory system is in condition to be powered down or removed. An example is a “stop transmission” command used in the Secure Digital standard, which requires that all data be guaranteed by the card. When such conditions occur, any data that is stored in a volatile memory (such as controller RAM or on-chip cache) is written to the memory array. Complete-page data is written to the area of the memory array that is dedicated to storage of complete-page data. Partial-page data is written to the area of the memory array that is dedicated to storage of partial-page data. However, where data is sent by a host without any prior indication of the amount of data that is being sent, it is not known beforehand whether the data will be complete-page data or partial-page data. For example, where a host sends streaming data (without any prior indication of the amount of data), the memory system receiving such data does not know whether a particular portion of the streamed data is going to be complete-page data or partial-page data until either enough data is received to fill a page (complete-page data), or conditions require saving the data prior to receipt of enough data to fill a page (partial-page data) e.g. because the host sends a stop transmission command.

One approach to handling such data is to store data in controller RAM until it is established whether the data should be saved as partial-page data or complete-page data. Then, when the controller establishes where the data is to be stored, the controller selects an address in the corresponding portion of the memory array (either in the partial-page portion, or full-page portion), and sends the data to be stored at that address. In this approach, data is maintained in controller RAM until a determination is made as to whether the data is partial-page data or complete-page data. However, this may require an undesirably large controller RAM (holding at least a page of data), which increases the cost of the controller. Also, there is some delay in transferring data to the memory array because the data is held in the controller RAM until a complete page is available, or programming of a partial-page is required. This may have a negative impact on system performance which may be especially significant where a host operates in small command sizes (such as 16 KB or 32 KB in SD Speedclass, for example).

Another approach is to temporarily store the data in an intermediate storage (IS) area. For example, the data may be stored in SLC initially and later stored in MLC when enough additional data is received to form complete-page data. However, this approach requires additional SLC blocks in the memory array, which increases the cost of the memory array. Also, management of such an intermediate storage area adds to system complexity and may not be desirable in low cost systems.

According to an embodiment of the present invention, data is sent from the controller to the memory chip prior to determining whether the data is partial-page data or complete-page data. Such data is then held in on-chip cache until a determination is made, and in response to the determination the data is stored in the appropriate location, either in the partial-page area or the complete-page area of the memory array. This means that the controller RAM can be relatively small because it does not have to hold data until the determination is made. Instead, the controller RAM sends the data to the on-chip cache prior to making any determination. The data may initially be assigned to a default destination. For example, the data may be assigned by default to the complete-page area. However, the data is not written to the memory array until a determination is made. The data is stored in on-chip cache and is only written to the memory array after enough data is received to form a complete-page, or some condition occurs which requires the data to be written as partial-page data.

FIG. 5 shows an example of a memory system 530 which stores complete-page data in one area of a memory array 532 and stores partial-page data in another area of the memory array 532. When data is received from a host without an indication of the amount of data being sent, the data is transferred from the controller to the on-chip cache 536 before determining if the data is complete-page data or partial-page data. If sufficient data is received to fill a page (i.e. fills a page buffer in the on-chip cache 536), then the data is written 537 to the complete-page area 532. If a condition occurs that requires storing the data to the nonvolatile memory prior to receipt of a full page of data, then the data is written 538 to the partial-page area 534.

FIG. 6A shows a timing diagram for a scheme in which data is held in controller RAM until a determination is made as to whether the data is complete-page data or partial-page data. Here, the host transfers 16 KB of data 640, which is held in controller RAM, thus setting a minimum size requirement for controller RAM. Once an entire 16KB is received, the controller determines that the data is complete-page data (because the page size is 16 KB in this example), so the data is transferred to the on-chip cache 642. From the on-chip cache, the data is written to the memory array 644.

In contrast, FIG. 6B shows a timing diagram for a scheme in which data is transferred to the on-chip cache prior to making a determination as to where to store the data. In this example, the data is transferred from the controller to the on-chip cache in 4 KB portions 646 a-d, though in other examples data may be transferred in different sized chunks. Because data is transferred from the controller RAM as soon as 4 KB is received from the host, the minimum controller RAM required is 4 KB. Also, after sufficient data to fill a complete page is received by the controller, writing the data may begin sooner because most of the data (three of four 4 KB chunks 646 a-c) have already been transferred to the on-chip cache. Thus, there is a time saving 650 so that the scheme of FIG. 6B reduces cost and improves performance compared with the scheme of FIG. 6A.

In a multi-plane design, partial-page data may extend over any number of planes. For example, in the two-plane example of FIG. 7, the partial page data may all be in plane 0, or may extend to plane 1. The handling of partial-page data may be different depending on where the break occurs in the data. Where all the data to be stored is less than a die-page and thus corresponds to just one plane, all the data is stored in the partial-page area. However, where the data to be stored extends over more than one plane, the data may not all be stored in the partial-page area. For example, FIG. 7 shows data in the on-chip cache 754 that corresponds to a complete die-page in plane 0 and a partial die-page in plane 1. In this arrangement, cached data may be programmed 756 to the complete-page area in plane 0, and programmed 758 to partial-page area in plane 1.

FIG. 8A shows a timing diagram for a scenario where a stop transmission command 860 is received at a point where all the cached data to be written is less than a die-page. Here, the stop transmission command is received after 7 KB of host data is received (where a complete page holds 16 KB, and each die-page holds 8 KB). 4 KB of data 862 is already in the on-chip cache at this point, and the remaining 3 KB is immediately transferred from the controller RAM to the on-chip cache 864. Because the data is less than the data of a complete page and must be written immediately, it is assigned to the partial-page area. In this example, the default assignment is to the complete-page area, so an address change is made 866. The address change may be performed by the memory controller or may be performed by peripheral circuits on the memory chip. The data is then programmed to the partial-page area of the memory array in plane 0. Because there is no data to be written in plane 1, programming of plane 1 may be inhibited during programming of this data. In another example, instead of redirecting the data in the on-chip cache to a different portion of the memory array, the data is read back to the memory controller and is then written to another portion of the memory array.

FIG. 8B shows a timing diagram for a scenario where a stop transmission command 868 is received at a point where the cached data extends over more than a single die-page. In this case 14 KB are received (where each die-page holds 16 KB) prior to receipt of the stop transmission command 868. When the stop transmission command is received 868, all data remaining in the controller RAM (2 KB in this example) is transferred to on-chip cache 870, which already contains 12 KB. In this example, there is sufficient data to fill a die-page in plane 0, so no address change is performed 872 and the data in plane 0 is programmed to its default destination 874 (i.e. to the complete-page portion of the memory array). Programming of plane 1 is inhibited during programming of plane 0. Subsequently, an address change is performed 876 and the data in on-chip cache to be programmed to plane 1 is programmed to the partial-page area 878. Thus, these two operations are performed sequentially. While this example (and some other examples presented here) show writing of upper page data, aspects of the present invention may also be applied to writing of lower page data. In some examples, partial page data is written to lower pages, while full page data is written to upper pages in the same physical portion of the memory array. Thus, the division of the memory array between a complete-page area and a partial-page area is not necessarily a physical division where cells store only one or the other. Cells may store partial-page data in one logical page while storing complete-page data in another logical page.

While the above examples show a partial-page area that extends across all planes of the memory array, in some cases the partial page area may be limited to a subset of all planes (as few as one plane). In the example of FIG. 9A, the partial-page area is limited to plane 0, so that the entire partial-plane area of the memory area is contained in plane 0. In this example, any partial-page data that is held in the on-chip cache 980 to be written to plane 1 is instead written to the partial-page area in plane 0. In contrast to the scheme of FIG. 7, in this example data in the on-chip cache is not written to the same plane but is relocated prior to writing. Starting with 12 KB in the on-chip cache (as shown in FIG. 7), the data in the plane 0 portion of the on-chip cache is written to the complete-page portion of plane 0 982. Next, instead of writing data in the plane 1 portion of the on-chip cache 980 to plane 1, this data is moved 984 to the plane 0 portion of the on-chip cache. This data is then written 986 to the partial-page portion of the memory array (which is in plane 0).

While the example of FIG. 9A shows data being relocated within on-chip cache without sending data to the controller, in some cases the controller may be used in transferring data. For example, data may be transferred from one chip to another chip where it is written. In this case, the data may be sent to controller RAM, and then sent from controller RAM to the chip where it is to be stored. FIG. 9B shows an example where on-chip cache 901 of chip 2 contains partial-page data 903 when a stop transmission command is received. In this example, it is desirable to write the partial-page data 903 to another chip (chip 1). This may be because chip 2 only stores complete-page data, or because any partial-page storage area on chip 2 is unavailable, or for some other reason. Partial-page data 903 is sent 904 to controller RAM 905 in controller 907. Then, partial-page data 903 is sent 908 from controller RAM 905 to on-chip cache 909 in chip 1. Partial-page data 903 is then written 911 to the memory array in chip 1.

FIG. 10 shows a flowchart for data storage according to some embodiments of the present invention. A stream of data is received 188 from a host and is sent 190 from the memory controller to an on-chip cache in chunks that are smaller than a page. The data in the on-chip cache may be assigned to a default address in the complete-page portion of the memory array. Whenever a sufficient number of chunks are received by the on-chip cache to fill a page, the cached data is written 192 to an address in the complete-page portion of the memory array. If no stop transmission command is received 194, the memory system continues to write data to the memory array as complete pages are received. If a stop transmission command is received at any point 194, then a determination is made as to whether on-chip cache contains data of a full page 195. If the data in on-chip cache is enough for a full page, then the data is written to the full page area 196. If the data in on-chip cache is less than a full page then it is written to the partial-page area of the memory array 198 This means that the default address in the complete-page area is replaced by an address in the partial-page area. In some cases, this means that all cached data is written to the partial-page area, while in other cases, cached data in some planes may be written to the complete-page area.

Although the various aspects of the present invention have been described with respect to certain preferred embodiments, it is understood that the invention is entitled to protection within the full scope of the appended claims. Particular examples show one or two planes in a memory array, though aspects of the invention apply to memory arrays using different numbers of planes.

All patents, patent applications, articles, books, specifications, other publications, documents and things referenced herein are hereby incorporated herein by this reference in their entirety for all purposes. To the extent of any inconsistency or conflict in the definition or use of a term between any of the incorporated publications, documents or things and the text of the present document, the definition or use of the term in the present document shall prevail. 

1. A method of managing storage of data in a memory array that separately stores complete-page data and partial-page data comprising: storing received data in an on-chip cache in a peripheral area of the memory array prior to writing the received data to the memory array; and when a stop-transmission command is received prior to writing the received data to the memory array, and the received data is less than the data of a complete page, writing the received data to an area of the memory array that is dedicated to storage of partial-page data.
 2. The method of claim 1 wherein the stop-transmission command is a Secure Digital stop-transmission command from a host.
 3. The method of claim 1 wherein the received data is initially assigned to a first location in the memory array that is dedicated to storage of complete-page data, and when the stop-transmission command is received, the data is assigned to a second location in the area of the memory array that is dedicated to storage of partial-page data.
 4. The method of claim 3 wherein the data is assigned to the second location by a memory controller, and the data is transferred from the on-chip cache to the memory array through the memory controller.
 5. The method of claim 3 wherein the data is assigned to the second location by an on-chip circuit that is peripheral to the memory array.
 6. The method of claim 1 wherein the memory array stores data in multi-level cells that individually store more than one bit of data.
 7. The method of claim 1 wherein the memory array comprises two or more planes that are programmed in parallel.
 8. The method of claim 7 wherein the area of the memory that is dedicated to storage of partial-page data extends across the two or more planes.
 9. The method of claim 7 wherein the area of the memory that is dedicated to storage of partial-page data is limited to one plane of the two or more planes.
 10. A method of managing streams of data in a nonvolatile memory array that does not permit more than one programming operation per page without erase, comprising: maintaining a first area of the nonvolatile memory array for storage of complete pages of data; maintaining a second area of the nonvolatile memory array for storage of partial pages of data; storing a portion of data in a page buffer in the memory periphery before a determination is made whether to store the portion of data in the first area or the second area; subsequently, determining whether to store the portion of data in the first area or the second area; and writing the portion of data from the page buffer to the first area or the second area according to the determination.
 11. The method of claim 10 wherein the determining is performed in response to receiving a stop-transmission command.
 12. The method of claim 11 wherein the stop-transmission command is a Secure Digital stop-transmission command received from a host.
 13. The method of claim 10 wherein the nonvolatile memory array comprises two or more planes and the second area extends across all of the two or more planes.
 14. The method of claim 10 wherein the nonvolatile memory array comprises two or more planes and the second area is contained within one of the two or more planes.
 15. The method of claim 14 wherein determining whether to store the portion of data in the first area or the second area further comprises determining whether the data is complete-page data or partial-page data.
 16. The method of claim 15 wherein the determining whether the data is complete-page data or partial-page data further comprises detecting whether a stop-transmission has been received.
 17. A method of handling streaming data in a buffered nonvolatile memory comprising: receiving a stream of data from a host without a prior indication of the amount of data; copying a portion of the stream of data from a memory controller to an on-chip cache prior to identifying the portion of the stream of data as partial page data; while the portion of the stream of data is in the on-chip cache, receiving an indication from the host that requires the portion of the stream of data to be written to the memory array; in response to receipt of the indication, determining that the portion of the stream of data is partial-page data; in response to the determination, selecting an area of the memory array that is dedicated to storage of partial-page data; and writing the portion of the stream of data to the area.
 18. The method of claim 17 wherein the indication is a Secure Digital stop-transmission command.
 19. The method of claim 17 wherein the memory array comprises two or more planes and the area extends across all of the two or more planes.
 20. The method of claim 17 wherein the memory array comprises two or more planes and the area is limited to fewer than all of the two or more planes.
 21. The method of claim 20 wherein the area is limited to one of the two or more planes.
 22. A nonvolatile memory system comprising: a nonvolatile memory array that includes a first area for writing complete pages and a second area for writing partial pages; and an on-chip cache in a peripheral area of the memory array, the on chip cache storing data to be written to the nonvolatile memory array prior to determining whether to write the stored data to the first area or the second area.
 23. The nonvolatile memory system of claim 22 wherein the memory array comprises two or more planes.
 24. The nonvolatile memory system of claim 23 wherein the second area extends across all of the two or more planes.
 25. The nonvolatile memory system of claim 22 wherein the second area is contained within one of the two or more planes.
 26. The nonvolatile memory system of claim 22 further comprising a memory controller that sends data to be stored in the nonvolatile memory array, the memory controller in communication with a host.
 27. The nonvolatile memory system of claim 26 further comprising a memory card physical interface for communication with the host.
 28. The nonvolatile memory system of claim 26 wherein the memory controller transfers partial pages from the on-chip cache to the second area through a controller RAM in the memory controller. 